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TECHNICAL FIELD 

This invention relates to parity operations in redundant disk drive systems, 
and particularly to parity operations in such systems that utilize two or more parity 
segments per storage stripe. 

BACKGROUND OF THE INVENTION 



Modem, high-capacity data storage-^stems often utilize a plurality of 
physical disk drives for redund^aTstorage of data. This arrangements speeds data 
access as well as protpeting against data loss that might result from the failure of 
any single disk/^ 

There are two common methods of storing redundant data. According to 
the first or "mirror" method, data is duplicated and stored on two separate areas of 
the storage system. In a disk array, for example, identical data is stored on two 
separate disks. This method has the advantages of high performance and high data 
availability. However, the mirror method is also relatively expensive, effectively 
doubling the cost of storing data. 

In the second or "parity" method, a portion of the storage area is used to 
store redundant data, but the size of the redundant storage area is less than the 
remaining storage space used to store the original data. For example, in a disk 
array having six disks, five disks might be used to store data, with the sixth disk 
being dedicated to storing redundant data, which is referred to as "parity" data. 
The parity data allows reconstruction of the data from one data disk, using the 
parity data in conjunction with the data from surviving disks. The parity method is 
advantageous because it is less costly than the mirror method, but it also has lower 
performance and availability characteristics in comparison to the mirror method. 
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One aspect of this invention involves storing redundant data according to 
parity techniques. In conventional disk arrays utilizing parity storage, the space on 
the storage disks are configured into multiple storage stripes, where each storage 
stripe extends across the storage disks. Each stripe consists of multiple segments 
of storage space, where each segment is that portion of the stripe that resides on a 
single storage disk of the disk array. 

Fig. 1 illustrates a conventional disk array 12 having six storage disks 13. 
In this simplified example, there are five storage stripes extending across the 
storage disks. Fig. 1 highlights data and storage segments of a single one of these 
five stripes. Data segments of the indicated stripe are indicated by cross-hatching. 
The corresponding parity segment of this same stripe is illustrated in solid black. 
Generally, of the six segments comprising any given stripe, five of the segments 
are data segments and the sixth segment is a parity segment. 

This type of parity storage is referred to as 5 + 1 parity storage, indicating 
that there are five data segments for every single parity segment. This scheme is 
more generally referred to as N + 1 grouping, where N is the actual number of data 
segments in a data stripe. 

N + 1 redundancy grouping such as illustrated in Fig. 1 protects against the 
loss of any single physical storage device. If the storage device fails, its data can 
be reconstructed from the surviving data. The calculations performed to recover 
the data are straightforward, and are well-known. Generally, a single parity 
segment P is calculated from data segments D 0 through D N _i in accordance with 
the following equation: 

P ~ Xq "t~ X i X2 -^N- 1 
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where x 0 through x N .\ correspond to the data from data segments D 0 through D N _i. 
After the loss of any single data segment, its data can be recovered through a 
straightforward variation of the same equation. 

In many systems, however, it is becoming important to protect against the 
loss of more than a single storage device. Thus, it is becoming necessary to 
implement N + 2 grouping in redundant storage systems. 

While N + 2 redundancy grouping enhances data protection, it also involves 
more complex calculations — both in initially calculating parity segments and in 
reconstructing any lost data segments. 

A general form of the N + 2 parity computation is as follows: 

P ~PoXq + P[X\ + /?2*2 + j^N-l-XN-l 
Q = q&O + #1*1 + #2*2 + <?N-1*N-1 

where: 

P is the value of a first parity segment; 

Q is the value of a second parity segment; 

x 0 through x N _i are the values of the data segments 

p 0 through /? N _! and q 0 through # N _i are constant coefficients that are 
particular to a given parity scheme. 

These equations form a two-equation system that, by the rules of linear 
algebra, can potentially solve for any two unknowns x a through x^ which represent 
the data from a single stripe of any two failed storage devices. One requirement is 
that the two sets of coefficients /?, and q t be linearly independent. This 
requirement is met, for example, ifpo = 1 ?j Pi = l>/>2 = 1; etc -i an ^ <?o = 1> <7i = 2, #2 
= 3; etc. Other examples are also possible. 
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The mathematics of N + 2 parity are well-known and are not the primary 
subject of this description. However, it is apparent from the brief description 
given above that N + 2 parity computations are significantly more complex than N 
+ 1 parity computations. In actual implementations of N + 2 disk arrays, this 
complexity threatens to limit the data throughput of storage device controllers and, 
consequently, of the overall disk array. 

This invention includes methods and means for maintaining adequate data 
throughput in spite of the added complexity resulting from N + 2 parity 
calculations. 

SUMMARY 

In accordance with one aspect of the invention, every possible parity- 
related computation is identified as a different scenario. A coefficient subset is 
selected or computed for each different scenario and stored in a memory table 
during an initialization process. To perform a particular operation, its scenario is 
identified and the corresponding coefficient subset is located. Hardware logic is 
then instructed to perform the actual parity operation, using the identified 
coefficient subset. This allows very efficient computations, using coefficients that 
are computed and selected ahead of time. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram showing N+l redundancy grouping in accordance 
with the prior art. 

Fig. 2 is a block diagram showing N+2 redundancy grouping as used in the 
described embodiment of the invention. 
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Fig. 3 is a block diagram illustrating layout of a memory table in 
accordance with the invention. 

Fig. 4 is a flowchart illustrating preferred steps in accordance with the 
invention. 

Fig. 5 is a block diagram showing pertinent components of a disk controller 
in accordance with the invention. 

DETAILED DESCRIPTION 
Parity Operations 

Referring to Fig. 2, a redundant data storage system 20 in accordance with 
the invention utilizes storage disks 22 with data stripes 24. Each data stripe 24 
comprises a plurality of data segments x 0 through x N _i and at least two 
corresponding parity segments P and Q. P and Q are derived from the data 
segments xo through jt N _i, from a first set of parity coefficients po through p^\, and 
from a second set of parity coefficients q 0 through # N _!. The parity coefficients 
correspond to respective data segments in accordance with the equations below: 

P —poXo + P\X\ + P2X2 + ^N-l-^N-l 

Q = qoX 0 + <7i*i + qix 2 + ?n-i*n-i 

In accordance with the invention, parity operations are generally classified 
as parity segment generation operations, parity segment regeneration operations, 
and data segment reconstruction operations. 

A parity segment generation operation is performed when creating a new 
data stripe — the parity segments are created based on completely new data. 

A parity segment regeneration operation is performed with respect to an 
existing stripe, either when new data causes the addition of a new data segment or 



10971442-1 



5 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 




when a read/modify/write cycle modifies one or more data segments. In a parity 
segment regeneration operation, the parity segments are modified incrementally, 
without reading an entire data stripe. For example, suppose that new data causes 
the addition of a new data segment # 4 . Pnew is calculated as follows: 

PnEW = POLD + P4*4 

Similarly, suppose that data segment x 2 is modified as the result of a 
read/modify/write cycle. In this case, Pnew is calculated as follows: 

Pnew = Pold - Pi^ioud + / ? 2^2new 

Calculating new P and Q values from the old data segment values involves 
significantly fewer memory reads than calculating P and Q from scratch after 
every stripe modification. 

In accordance with the invention, parity segment regeneration operations 
are further classified as either parity regenerations resulting from added data 
segments or parity regenerations resulting from a modified data segment. 

Data segment reconstruction operations include two sub-classifications: 
single data segment reconstruction operations and double data segment 
reconstruction operations. A single data segment can be reconstructed from either 
the P or the Q parity segment in combination with the surviving data segments. 
Generally, a data segment x a is reconstructed from either parity segment P or Q as 
follows: 

X a = f(P0,Pa)X0 + f(p\,Pa)Xi + . . . + f(p a )P + . . . + f (?N- b /^)*N- 1 
X a = %0, ? fl )Xo + ffal, q a )X\ + . . . + ffo,)Q + . . . + f(?N-l, qa)XN-l 

where f( ) is a transformation function that generates an appropriate coefficient 
particular to the parity generation code being used. 

One implementation of these equations is as follows: 
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*a = Pa' (P0*0 + P\M + . . . + P + . . . + A*-l*N-l) 
x a = Pa' (tfoXo + ?*l + ..- + Q + ...+ 0N-l*N-l) 

Two data segments can be reconstructed from the P and the Q parity 
segments in combination with the surviving data segments. Generally, two data 
segments x a and jc^ are reconstructed from parity segments P and Q as follows: 

x a = f(Po> qo, Pa, Pb, q a , ?*) x o + f(f>i, qu Pa, Pb, q a , qb)x\ + - • - + f(p a , Pb, Qa, 
gb)P + ffpt, q** Pa,Pb, q a , qb)x k + fOvt+i, qk+u Pa, Pb, qa, qb)Xk+\ + . . . + 

£{p a ,Pb, qa, qb)Q + . . . + f(PN-l, q^A,Pa,Pb, q a , qb)XN-\ 

x b = f(po, qo,Pa,Pb, q a , qb)*o + f (pi> q\,p a ,Pb, q a , qb)x\ + f(p a ,Pb, q a , ^)P + 

$(Pk, qu, Pa, Pb, q a , qb)x k + f(p*+i, q^u Pa, Pb, q a , qb)x k +\ + . . . + f(p a , 

Pb, qa, qb)Q + . . . + f(PN-U qN-\,Pa,Pb, q a , #&)*N-1 

where f( ), again, is a transformation function that generates an appropriate 
coefficient particular to the parity generation code being used. 

One imp lementation of these equations is as follows: 

x a = (p, qb + Pb qa)' 1 ((qbPo+Pb qo)x 0 + {qoPi + Po qi)x\ + . . . + qfi + . . . + 
(qbPk+Pb qk)Xk + (qbPk-i +Pb qk+i)x k +\ + . . . +PbQ + . . . + (qbPN-j + 

Pb<lN-l) X N .j) 

Xb = (Pa qb + Pb qa)' 1 {(q a Po + Pa qo)x 0 + {q a Pi + Pa qi)x\ + . . . + q a P + - - - + 
(q a Pk+Pa qk)Xk + feft-y +p a qk+i)x k+ \ + . . . + A*Q + . . . + iqaPN-i + 

Pa<lN-l) X N .j) 

Generally, all ofthe parity operations described above can be accomplished 
by using a different combination of known coefficients chosen from a base set 
having a finite number of such coefficients. These coefficients include po-p^-u qo~ 
q^-\, and the coefficients resulting/ from the transform functions f( ). Any 
particular parity operation utilizes a subset of these coefficients, depending on the 



10971442-1 



7 



actual data or parity segment being calculated. The particular subset of 
coefficients needed for a particular calculation^ depends on both the classification 
of the operation and upon the specific dat^mid/or parity segments involved. Thus, 
within a given classification of parity ^operation, there are different situations or 
scenarios, each of which calls for a/lifferent subset of coefficients. For example, 
one scenario occurs when addingydata segment x$ to a stripe, when coefficients p 5 
and q 5 are needed. Another scenario occurs when adding data segment to a 
stripe, when coefficients p$ ana q 6 are needed. 

Coefficient Subsets 

j^^> Fig. 3 shows a memory array 3^/that contains plurality of coefficient 
subsets 3 1 . Each coefficient subset is a list or concatenation of pre-selected and/or 
pre-computed coefficients that are applied to corresponding segments of data to 
produce a parity computation result In accordance with the invention, a different 
subset of coefficients is pre-selected and stored for each different operation 
scenario. The subsets are then formatted and packed in a linear memory array for 
reference and direct use by parity operation logic. Because different scenarios call 
for different numbers of coefficients, the subsets are not of the same length or size. 

Each coefficient is a single byte in the described embodiment of the 
invention. The term "packed" means that the subsets or strings of coefficients are 
concatenated in linear memory, preferably with no intervening unused spaces, to 
conserve storage space. 

There is a one-to-one correspondence between a coefficient in a subset and 
a segment of data (either a data segment or a parity segment) when performing the 
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parity operation. Each coefficient is applied only to its corresponding data 
segment or parity segment to produce the result of the operation. 

One coefficient subset is included in the array for each possible parity 
computation case or scenario. Unique indexing formulas are used to locate the 
beginning of a subset in the array for each specific computational scenario. 
Generally, the subsets are arranged in pairs, corresponding to computations 
involving P and Q, respectively. 

Referring to Fig. 3, memory array 30 includes a plurality of classification 
groups 32, 33, 34, 35, and 36, each of which contains the coefficient subsets 31 
corresponding to a particular parity operation classification. Each subset in a 
classification group has coefficients for a specific scenario that occurs within the 
group's classification. With one exception, the coefficient subsets are the same 
size within any given classification group. 

Within array 30, particular classification groups are located by computing a 
group offset from the beginning of the array to the beginning of the group. This 
group offset is the base index into the array for the group. To locate a specific 
coefficient subset within a classification group, a subset offset from the beginning 
of the group is added to the base index. This produces an index into the array that 
locates the beginning of the desired coefficient subset. 

In accordance with one embodiment of the invention, the general parity 
operation classifications are defined as follows: 

1. Parity Generation Operations — Partial or full new stripe that has 
no pre-existing data or parity. 
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2. Parity Regeneration Operations Resulting From Added 
Segments — Incremental growth of a stripe by incorporating new 
data segments into the two parity segments. 

3. Parity Regeneration Operations Resulting From Segment 
Modification — Modification of a data segment that is already 
incorporated in the two parity segments (read/modify/write). 

4. Single Data Segment Reconstruction — Reconstruction of a single 
data segment using one of the parity segments and the surviving data 
segments from the strip. Reconstruction from either P or Q parity 
segments is supported because in the case of two failed storage 
devices, one of the failed storage devices may hold P or Q. 

5. Double Data Segment Reconstruction — Reconstruction of two 
data segments of a stripe using the two parity segments P and Q, and 
the surviving data segments from the stripe. 

Structure of Classification 1 Coefficient Subsets 

The first classification group 32 of the array contains the coefficient subsets 
for parity generation operations. A parity generation operation generates new P 
and Q segments from new data segments x 0 through x N .j. There are only two 
coefficient subsets in this classification group. The subsets correspond 
respectively to the generation of parity segments P and Q: 

P: {po,Pu • . -Pn-i} and 

Q: {q 0 , qu • • - 2n-i} 

Each of these subsets is the same length (N). 



10971442-1 



10 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 




Structure of Classification 2 Coefficient Subsets 

The second classification group 33 of the array contains the coefficient 
subsets for parity operations that add incrementally to a stripe. This type of 
operation updates P and Q segments in combination with any given contiguous 
range of new or added data segments x a through x b (where b < N and a <= b). 
There are multiple different scenarios of these operations, corresponding to every 
possible range a through b of data segments within data segments 0 through N-l. 
Each scenario calls for a different subset of coefficients. For example, if the new 
or added data segments are x 3 and x 4 , the required coefficient subset to calculate P 
is {pi, p 4 }. If the new or added data segments are x 2 through x 5 , the required 
coefficient subset to calculate P is {p2,Pi> P4, Ps}> The total of possible ranges 
within data segments 0 through N-l depends on the value of N. 

Each coefficient subset of classification group 2 contains two initial 
parameters that indicate whether the subset applies to calculations of P or to 
calculations of Q. Each of these initial parameters is set to either "0" or "1". A 
value of "1" for the first of these coefficients indicates that the calculation involves 
parity segment P. A value of "1" for the second of these coefficients indicates that 
the calculation involves parity segment Q. Only one of these two parameters 
should be set equal to "1" at any given time. 

The remaining coefficients in a Classification 2 subset are the sub-range of 
coefficients that are used to regenerate P and Q from newly added data stripes. 
Thus, the classification group contains a plurality of coefficient subsets of the 
form: 

P: {1,0, p a , . . ,p b ) and 
Q: {0, l,0 a , . . .q b } 
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Classification group 33 includes a plurality of subsets such as these, 
depending on N, corresponding to every range of a through 6, within the larger 
range of 0 through N-l. The coefficient subsets in this section of the array have 
varying lengths or sizes, equal to b — a for each operation scenario. 

Within this classification group, coefficient subsets are arranged and 
grouped by length. That is, the coefficient subsets containing the smallest number 
of coefficients are placed in the initial part of the classification group. The 
coefficient subsets containing the largest number of coefficients are placed at the 
end of the classification group. Within each of these groupings, the coefficient 
subsets are arranged in order according to the lower coefficient subscript of the 
range that is covered by the coefficient subset. Thus, the subsets having a = 0 are 
positioned first, the subsets having a = 1 next, and so on. 



Structure of Classification 3 Coefficient Subsets 

The coefficient subsets in the third classification group 34 are used to 
update P and Q when a single data segment is modified. This type of operation 
dates P and Q segments, given a modified data segment x a . 

As with the Classification 2 group, thp^ffrst two parameters of each 
Classification 3 subset indicate whether^thecoefficients of the group are applicable 
to P calculations or to Q calcul^tkJns. Each of these coefficients is set to either "0" 
or "1". A value of "V*£dr the first of these coefficients indicates that the subset 
coefficients appjy^to parity segment P. A value of "1" for the second of these 
coefficient^indicates that the subset coefficients apply to involves parity segment 
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Each subset contains a single remaining coefficient, corresponding to the 
data segment x a that is being modified: 
P: {1,0, p a } and 
Q: {0, l 9 q a ] 

The third classification group 34 includes N pairs of such subsets, 
corresponding to all values of a from 0 through N-l. Note that these subsets 
correspond to a special case of the Classification 2 coefficient subsets, in which a 
= Z>, and can therefore be used when adding a single new data segment to a stripe. 

Structure of Classification 4 Coefficient Subsets 

The coefficient subsets in the fourth classification group 35 are used to 
reconstruct a single data segment x a based on one of the parity segments and the 
surviving data segments. The coefficients correspond closely to the Classification 
1 coefficients, except that they are transformed according to the mathematics (f( )) 
of the chosen error correction code to perform a reconstruction operation: 

P: { f(po,p a ), f(PuPa), • - • , f(Pa), • • , ^N-U Pa) } 

Q: { f(q 0 , q a ), ffau q a \ • • • , - . . , f(?N-i, q a ) } 
More specifically: 

P: (p, qb + Pb q a )' X {{qbPo +Pb qo), (goPi +Po qi), . . ■ , qiP, . . . ,(qbPk + 

Pb qk) , {qbPk-i +Pb qtc+i), • . • ,PbQ + - . . + (qbPN-i + Pb<iN-i)) 
P: (p a qb + Pb q a )' 1 {{qaPo +Pa qo), iflaPi + Pa qi)> . • • ,qaP + . . .+(q a Pk 

+ Pa qk) » (qaPk-J +Pa qk+l), • • • >PaQ, - * , {qaPN-1 + PaQN-l)) 

The fourth classification group includes N pairs of such subsets, 
corresponding to all values of a from 0 through N-l. Note that in each subset, the 
coefficient f(p a ) or f(q a ) corresponds to data segment jt a . 
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Structure of Classification 5 Coefficient Subsets 

The coefficient subsets in the fifth classification group 36 are used to 
reconstruct two data segments x a and x b based on the two parity segments and the 
surviving data segments. The coefficients correspond closely to the Classification 
1 coefficients, except that they are transformed according to the mathematics (f( )) 
of the chosen error correction code to perform a reconstruction operation: 

*a' { f(P0> qO,Pa,Pb, <la, qb), f(Pl ? q\, Pa, Pb, Va, q'b), • • > f(Pa, Pb, <Ja, <Jb), • 

• . , f(Pk, qk,Pa,Pb, q a , qb), f(f>k+i> qk+\,Pa 9 Pb, q a > qb), • • > f(p a ,Po, q a , 

qb), • • > f(PN-i» q^-uPaiPb, q a , qb) } 

x b : { f(p 0 , qo,p a9 Pb, q^ qb), f(Pu quPaiPb, q a > qb), . . . , ?(Pa,Pb, q a , qb), . 

• • > f(Pk» qk,Pa>Pb, q a > qb), $(Pk+u qk+\,Pa,Pb, q a , qb), • • • , f(p a ,Po, q a , 
^ qb), • • • , f(£>N-i> q^-\,Pa,Pb, q a , qb) } 

The sect i° n °f the array includes (N*(N-l))/2 pairs of such subsets, 
corresponding every possible combinatior^)f a and b within the range of 0 to N-l. 
Note that in each subset, the coefficient f(p a , q a9 q^) corresponds to data 
segment x a or x\» depending on which oata segment is being reconstructed. 
One possible implementation of these equations is as follows: 
x a '< (Pa, qb + Pb, qa)~\<lb,Po + Pb, qo), (Pa, qb +Pb, qa)~\<\b,Pi +Pb, qi), • • 

. , (Pa 9 qb +Pb, q a ) A qb, . . . , (p a > qb + Pb, q a )'\^o, Pk+ po, qk), (p a , qb + 
Pb, qa)' x (<\o, Pk+i + po, qk+i), . . . , (Pa, qb + Pb, q a )' l Pb, . . . , ip a , qb + 

Pb, qa)'\^0,PN-l +po, qN-i) 
*b* (Pa, qb + Pb, q a )'\^b,Po +Pb, qo), (Pa, qb + Pb, qa)'\q a Pi +p a , qi), - - 

• > (Pa, qb + Pb, qa)' X qa, . . . ,(Pa,qb + Pb, qa)~X<\a,Pk+Pa, qk), (Pa, qb + 
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Pb> q a y X {<\a, Pk+1 + Pa> qk+l)> • - • , (Pa, qb + Pb, q^Pa* • • • , (Pa, qb + 
Pb> q a y\<\a>PN-l +Pa, qN-l) 



Coefficient Subset Usage 

Fig. 4 illustrates a method of performing parity operations in accordance 
with the array storage scheme described above. A first step 100 comprises 
classifying different parity operations into classifications that include parity 
segment generation operations, parity segment regeneration operations, and data 
segment reconstruction operations. More specifically, an operation is classified as 
either a parity generation operation, a parity regeneration operation resulting from 
added segments, a parity regeneration operation resulting from segment 
modification, a single data segment reconstruction operation or a double data 
segment reconstruction operation. Each classification of parity operations 
includes a plurality of different classification scenarios, each of which involves a 
respective subset of parity coefficients. 

A step 102 comprises pre-calculating individual parity coefficients and pre- 
selecting subsets of parity coefficients for use in the different parity operations and 
the different scenarios of parity operations. This step is performed in accordance 
with the description already given. 

A step 104 comprises storing all of the pre-selected parity coefficient 
subsets in an indexed linear memory array, where they can be accessed by parity 
computation logic. This step includes pre-formatting the coefficient subsets so 
that they can be efficiently utilized by hardware-based parity operation logic. In 
particular, the individual coefficients of each subset are packed in adjacent bytes 
or storage units and ordered in a way that is particular to the hardware-based parity 
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operation logic. As a result of this step, the memory array contains a single 
coefficient subset corresponding to each different computation scenario. 

The individual coefficients and the subsets of coefficients are packed with 
no intervening data elements. The subsets of the array are grouped and ordered as 
already described, with the coefficient subsets grouped into classification groups 
by order of their classifications. Within the second classification group, the 
subsets have varying sizes. In addition, the subsets in the second classification 
group are sub-grouped by size, and ordered in ascending order according to their 
lowest-numbered coefficient. 

During parity operations, parity operation logic accesses the memory array 
to obtain the appropriate coefficient subsets for use in the different scenarios of 
parity operations. Thus, a step 106 comprises determining which of the stored 
subsets of parity coefficients is needed for a particular parity operation. This step 
involves determining the classification of the parity operation and a group offset 
into the linear memory array, indicating the beginning of the classification group 
corresponding to that parity operation classification. A subset offset is then 
calculated into the group, to the location of the desired coefficient subset. 

Step 106 is straightforward except with regard to the second classification 
group. As described in detail above, the second classification group contains 
coefficient subsets of varying lengths or sizes, making it difficult to determine the 
offset of a particular coefficient subset. However, the inventors have discovered 
that when the second classification group is arranged as described, having ordered 
subgroups of same-sized coefficient subsets, an offset to a particular subgroup can 
be calculated as a function of the size of the coefficient subsets of the sub-group 
and of N (the largest number of coefficients contained by any sub-group). 
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Specifically, the offset to a particular sub-group i corresponding to subset size L, is 
equal to : 

((L, - 1)(12N + L,{3N - 2L f - - 5))/6) - 3(N - 1). 
This formula assumes the presence in each subset of the prepended pair of 
constants (described above) corresponding to P and Q. L, however, equals b - a. 
Within the sub-group /, the offset of a particular coefficient subset is equal to <s(L; 
+ 2). Thus, the overall offset into the classification group, for a range of 
coefficients corresponding to x a through is : 

(((Li - 1)(12N + L f <3N - 2L, - 5))/6) - 3(N - 1)) + a(L, + 2). 
The size of the second classification group is given by the following 
equation: 

((N - 1)(12N + N(3N - 2N - 5))/6) - 3(N - 1). 
After determining the appropriate offset into the memory array, a step 108 
is performed of reading the determined parity coefficient subset from memory. 
Step 110 comprises performing the particular parity operation with the subset of 
parity coefficients read from memory. 

Disk Controller Operation 

Fig. 5 illustrates the most pertinent components of a disk controller 200 in 
accordance with the invention. The disk controller includes a microprocessor 201 
and associated memory 202. In addition, the disk controller has a hard disk 
interface component 203 and a communications component 204. The hard disk 
interface component provides a means of access to the hard disks associated with 
and controlled by the disk controller. The communications component acts as an 
interface between a host computer and the hard disk controller. 
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In addition to these components, hard disk controller 200 includes 
hardware-based parity operation logic 205 in the form of an application-specific 
integrated circuit (ASIC). The term "hardware-based" is intended to mean that 
this logic component, as opposed to software-based logic, does not retrieve and 
execute instructions from program memory. Rather, the logic component has 
dedicated, interconnected logic elements that process signals and data. Although 
hardware-based logic such as this is less flexible than a microprocessor or other 
instruction-based processors, hardware-based logic is often much faster than 
instruction-based logic. 

In general, the disk controller operates as follows. Microprocessor 201 
handles communications with the host computer and coordinates all data transfers 
to and from the host controller. In addition, the microprocessor coordinates all 
actual disk transfers. However, data is buffered in memory 202 prior to writing to 
disk. Parity operations are performed on data in memory 202 under the control of 
microprocessor 20 1 . 

During initialization, microprocessor 201 constructs a coefficient subset 
table 212 in memory 202. Subsequently, when it is time for a parity operation, 
microprocessor 201 determines the classification and scenario of the parity 
operation. Once this information is determined, the microprocessor creates a 
script that indicates the locations in memory 202 of one or more data segments and 
parity segments (referred to collectively as data blocks 214) that will be the object 
of the parity operation. The script indicates an offset into the coefficient subset 
table 212 at which the proper coefficient subset will be found for the parity 
operation, and the number of coefficients that are contained in the coefficient 
subset. The script also indicates the location in memory 202 at which the result of 
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the requested calculation is to be placed. Each script stores information for a 
single parity operation and the memory structure for storing such scripts is referred 
to herein as a task description block (TDB). The TDB is stored in a particular 
location in memory 202 and a pointer to that location (e.g., a 32-bit address) is 
stored in a TDB queue 216 in memory 202. A response queue 218 is also stored in 
memory 202 and used by hardware logic 205 to return an indication to 
microprocessor 201 as to whether a parity operation was successful. 

When a script is placed in memory, the hardware logic is notified by the 
presence of the pointer to the TDB for the script in queue 216. The hardware logic 
responds by (a) retrieving the designated coefficients, data segments, and parity 
segments, (b) performing the appropriate parity operation based on the designated 
coefficients, (c) returning the data and/or calculated parity segments to memory, 
and (d) indicating to microprocessor 201 whether the operation has been 
successfully completed by writing an entry into response queue 218. 

The hardware logic is configured to perform the various different parity 
operations by summing products of coefficients and data/parity segments. The 
different operations actually vary only in the number and choice of coefficients, 
data segments, and parity segments. These variables are specified by the script. 
Thus, the operations lend themselves very conveniently to hardware-based 
calculations. 

jj^^ A simultaneously filed and /:o-pending U.S. Patent Application attorney 
docket no. 10001494, entitled 'XJsing Task Description Blocks To Maintain 
Information Regarding Operations' 1 , to inventors Barry J. Oldfield and Robert A. 
Rust, describes TDBs and their use, and is hereby incorporated by reference. 
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Other simultaneously filed and co-pepding U.S. Applications describe 
technologies useful in conjunction with the invention, including U.S. Patent 
Application attorney docket no. 10001489, entitled "Methods And Arrangements 
For Improved Strip-Based Processing^ to inventors Barry J. Oldfield and Robert 
A. Rust; U.S. Patent Application sfttorney docket no. 10001493, entitled "Memory 
Manager for a Common Memory, to inventors Barry J. Oldfield and Robert A. 
Rust; and U.S. Patent Application attorney docket no. 10001491, entitled 
"Efficient Parity Operations," to inventors Michael B. Jacobson, Barry J. Oldfield 
and Robert A. Rust;. Tnese applications are hereby incorporated by reference. 

Conclusion 

The parity calculation architecture described above has a number of 
advantages over the prior art. One significant advantage is that the architecture 
allows parity computations to be performed by hardware-based logic, without 
requiring significant complexity in the hardware. To provide this advantage, a 
microprocessor performs preliminary work such as designating the various 
parameters to be used in the calculations. Once the proper coefficients and 
data/parity segments have been designated, the hardware can perform the actual 
calculations in similar or identical ways, regardless of the particular type of 
operation that is requested. 

The pre-selection of coefficient subsets, and their arrangement in memory, 
provides further efficiencies by eliminating many steps that would otherwise be 
required to select coefficients prior to every parity operation. Furthermore, the 
unique indexing method, particular with regard to the described second section of 
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the array, allows the coefficients to be packed in memory to save space, even 
though the coefficient subsets have differing lengths. 

Although the invention has been described in language specific to structural 
features and/or methodological steps, it is to be understood that the invention 
defined in the appended claims is not necessarily limited to the specific features or 
steps described. Rather, the specific features and steps are disclosed as preferred 
forms of implementing the claimed invention. 
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